Reconstructing audio signals from modified non-coherent hilbert envelopes
نویسندگان
چکیده
In this paper, we present a speech and audio analysis-synthesis method based on a Basilar Membrane (BM) model. The audio signal is represented in this method by the Hilbert envelopes of the responses to complex gammatone filters uniformally spaced on a critical band scale. We show that for speech and audio signals, a perceptually equivalent signal can be reconstructed from the envelopes alone by an iterative procedure that estimates the associated carrier for the envelopes. The rate requirement of the envelope information is reduced by low-pass filtering and sampling, and it is shown that it is possible to recover a signal without audible distortion from the sampled envelopes. This may lead to improved perceptual coding methods.
منابع مشابه
Modified Discrete Cosine Transform for Encoding Residual Signals in Frequency Domain Linear Prediction
Frequency Domain Linear Prediction (FDLP) uses auto-regressive models to represent Hilbert envelopes of relatively long segments of speech/audio signals. Although the basic FDLP audio codec achieves good quality of the reconstructed signal at high bit-rates, there is a need for scaling to lower bit-rates without degrading the reconstruction quality. Here, we present a method for improving the c...
متن کاملMdct for Encoding Residual Signals in Frequency Domain Linear Prediction
Frequency domain linear prediction (FDLP) uses autoregressive models to represent Hilbert envelopes of relatively long segments of speech/audio signals. Although the basic FDLP audio codec achieves good quality of the reconstructed signal at high bit-rates, there is a need for scaling to lower bit-rates without degrading the reconstruction quality. Here, we present a method for improving the co...
متن کاملAudio Coding Based on Long Temporal Contexts
We describe novel audio coding technique designed to be utilized at medium bit-rates. Unlike classical state-of-the-art audio coders that are based on short-term spectra, our approach uses relatively long temporal segments of audio signal in critical-band-sized sub-bands. We apply auto-regressive model to approximate Hilbert envelopes in frequency sub-bands. Residual signals (Hilbert carriers) ...
متن کاملNon-uniform Speech/Audio Coding Exploiting Predictability of Temporal Evolution of Spectral Envelopes
We describe novel speech/audio coding technique designed to operate at medium bit-rates. Unlike classical state-of-the-art coders that are based on short-term spectra, our approach uses relatively long temporal segments of audio signal in critical-band-sized sub-bands. We apply auto-regressive model to approximate Hilbert envelopes in frequency sub-bands. Residual signals (Hilbert carriers) are...
متن کاملA Novel DOA Estimation Approach for Unknown Coherent Source Groups with Coherent Signals
In this paper, a new combination of Minimum Description Length (MDL) or Eigenvalue Gradient Method (EGM), Joint Approximate Diagonalization of Eigenmatrices (JADE) and Modified Forward-Backward Linear Prediction (MFBLP) algorithms is proposed which determines the number of non-coherent source groups and estimates the Direction Of Arrivals (DOAs) of coherent signals in each group. First, the MDL...
متن کامل